Power-law distributions from additive preferential redistributions 
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Abstract 

We introduce a non-growth model that generates the power-law distribution with the Zipf exponent. 
There are N elements, each of which is characterized by a quantity, and at each time step these quantities 
are redistributed through binary random interactions with a simple additive preferential rule, while the sum 
of quantities is conserved. The situation described by this model is similar to those of closed iV-particle 
systems when conservative two-body collisions are only allowed. We obtain stationary distributions of these 
quantities both analytically and numerically while varying parameters of the model, and find that the model 
exhibits the scaling behavior for some parameter ranges. Unlike well-known growth models, this alternative 
mechanism generates the power-law distribution when the growth is not expected and the dynamics of the 
system is based on interactions between elements. This model can be applied to some examples such as 
personal wealths, city sizes, and the generation of scale-free networks when only rewiring is allowed. 
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I. INTRODUCTION 



Power-law distributions have been observed in diverse fields for more than a century yj]. Some 



a. 



3. 4jL word frequencies Q], 



well-known examples exhibiting 'scaling' behavior are city sizes J2I 

sizes of business firms Q], personal incomes J^H^], personal wealths |9,|lo[[liL sizes of web sites 
I12I1 . numbers of links of web pages Jo! , connections of routers in the Internet J14I . species in 
genera I15I . interactions of proteins Jl6l . citations of scientific papers [17] and so on, covering 
many research fields such as biology, economics, sociology, engineering, and physics. Many gen- 
erative models have been introduced so far to explain this ubiquitous phenomenon fujl . and most 
of them use simple mechanisms that give rise to the power-law distributions. One group of models 
uses stochastic multiplicative processes J2I Q, [3], and another group uses preferential growing 
mechanisms flQ.Sl- These models have their root in the Gibrat's law of proportional growth 
0, and are based on two assumptions: the growth of the system and non-interaction between 
elements. There are also non-growth models in which the main mechanism is the interaction be- 
tween randomly chosen elements, resulting in the multiplicative changes of values 0,0, HQ. 
Systems showing the scaling behavior consist of iV elements (N may vary with time), while each 
element % (1 < i < N) is represented by the quantity k { , and the probability of an element having 
the value k, P(k), has the form k~"' for a given range of k. 

Here we introduce a non-growth model exhibiting the power-law distribution with the Zipf 
exponent (7 = 2), in which quantities of elements are redistributed through binary random in- 
teractions with a simple additive preferential rule. The model assumes that N and the sum of all 
fcj's are conserved, and that, when two elements i and j are chosen randomly at a given time, ki 
and kj will be changed additively while preserving k{ + kj. This model can be a mechanism that 
explains scaling behavior of many socio-economical systems, especially when the growth is not 
expected and interactions between elements are vital to their dynamics. Moreover, this model can 
be extended to generate scale-free networks through rewiring only, because the rewiring process 
by changing an end point of a link changes degrees of two nodes additively while preserving the 
sum of degrees of all nodes. 

In this paper, the model and its stationary distributions are investigated both numerically and 
analytically. In Sec. II, the model is described in detail. In Sec. Ill, the master equation is obtained. 
Stationary distributions are found numerically first, and then analytically solved. And the condi- 
tion for the power-law distributions in the parameter space is also found using both numerical and 
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analytic methods. In Sec. IV, three possible applications of this model are discussed. Finally, in 
Sec. V, we summarize our results. 



H. MODEL 

Let us introduce our stochastic model in detail. The model assumes that fcj's are non-negative 
integers, and we define a as the average quantity per element, 

oo 

a = E kP(k) = Ik). (1) 

k=0 

At each time step T, two elements, i and j, are randomly chosen, and the element i gives one 
unit of the quantity to the element j with the exchange probability R; hence their quantities are 
changed additively, fcj — > fc* — 1 and fcj — > kj + 1, while fcj + fcj is conserved (as a result, a becomes 
a conserved quantity). In other words, i is the giver and j is the taker, while the probability of non- 
exchange is 1 — R. One simple way to give an advantage to an element with bigger k is letting R 
dependent on k { and kj as below, 



R 



1 (0 < h < kj) 

(3 {h > k 3 ) (2) 

o {h = o) 



where (3 is a constant in the range of < (3 < 1. In this model, we can represent the system 
with three independent parameters: N, a and (3. When a distribution is given initially at T = 0, 
P{k) will evolve as T increases, and eventually reach a stationary distribution. To express the 
cumulative distribution, we also define P(> k) = T^ =k P{k') 

The parameter (3 plays an important role in this model. Two special cases of (3 = and (3 = 1 
have been previously discussed in the context of conserved exchange processes 

flHQ. The 

focus of this paper, however, is the general case of < (3 < 1. When (3 < 1, the time-reversal 
symmetry of the dynamics is broken, and at the same time the elements with bigger k ('the rich') 
get an advantage over those with smaller k ('the poor'). Then this model becomes one of rich-get- 
richer mechanisms, which will generate broad stationary distributions. In the next section, we will 
show that the stationary distribution from this model exhibits the power law when (3 is less than a 
certain critical value, and that this critical value will depend on the value of a. 
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III. STATIONARY DISTRIBUTIONS 



First we look at the dynamics of an element with the quantity k to focus on the evolution of an 
element. To gain one unit, an element should be chosen as the taker with the probability 1/N, and 
the probability of an element gaining one unit, T + (k), depends on the choice of the giver. Similarly 
the probability of an element losing one unit, T_(/c), when chosen as a giver, can be found, 

T + (k) = [l-P(0)-(l-(3)P(>k + l)}, 

T.(k) = (l-S k0 )[p+(l-(3)P(>k)}. (3) 

Then, for an element, the expected change of k after a time step, Ak, is [T + (k) — T_(fc)]/iV. Since 
Ak is not proportional to k, the Gibrat's law is not satisfied. In a sense, each element is performing 
the random walk if we regard k as the position, while the transition probability found in Eq. © is 
asymmetrical, position-dependent and time- varying. 

If we use the continuous approximation as N — ► oo, the master equation can be obtained, 

AP(k) = [P(k - l)T + (k - 1) - P(k)T_(k)} - [P(k)T + {k) - P(k + l)T_(Jfe + 1)], (4) 

where AP(k) is the expected change of P(k) after one time step. Then from the condition for the 
stationary distribution, AP(k) = (Vfc, k > 0), we find that stationary distributions should satisfy 
these nonlinear equations, 

p(k + D- T+(fc) p(k) - i--r(Q)-(i-OT>fc + i) P(k) (5) 

for k > 0, because, in Eq. ©, there are two parts, two terms each, and each part should be zero 
when AP(k) = 0. Even though we can theoretically find P(k) as a function of a and (3 using Eqs. 
© and ©, these nonlinear equations are not easily solved analytically except for some special 
cases. 



A. Case of (3 = 

This is a trivial winner-take-all situation. When [3 = 0, the rich will always win for every binary 
interaction. Even without solving Eq. ©, the stationary state and its distribution are trivially 
found. In the stationary state, one element has all quantities, k = aN, and the other elements have 
no quantity, k = 0; therefore the stationary distribution is 

N- 1 1 

P{k) = — 5 k0 + j^S kjaN . (6) 
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As TV — > oo, P{k) becomes 5ko approximately. 
B. Case of (3 = 1 

When (3 = 1, the model describes the conserved random exchange process, which was already 
discussed previously 1^,13]. From Eq. ©, we easily find P{k) exactly as [1 — P(0)] fc P(0). After 
substituting P(k) into Eq. © to find -P(O), we obtain the stationary distribution, 



As a — > oo, P{k) becomes (1/a) exp [—k/a], which is the Boltzmann-Gibbs distribution. 
C. Case of < (3 < 1 

In this general case, the rich have an advantage over the poor, but lose to the poor from time to 
time. This property keeps the stationary distribution balanced somewhere between those from two 
extreme cases discussed above. Since the analytic method cannot be solely used in this case, the 
model is numerically investigated first. 

After performing extensive numerical simulations while varying N, a, and (3, we found that 
the stationary distributions exhibit the power law when a and (3 satisfy a certain condition; that 
is, it is scaling when (a, (3) is inside a region in (a, /3)-space, represented by a condition such as 
f(a, (3) < e, where f(a, (3) and e will later be found in this section. In Fig. [2 we show a scaling 
case of N = 10 5 , a = 1, and (3 = 0.1. As time increases, the initial distribution, Ski, evolves to a 
power-law stationary distribution, which is shown using the cumulative distribution, P(>k). (In 
all simulations here, a is a positive integer, and the initial distributions of the delta-function form, 
5ka, will be used.) 

One common property that stands out in all scaling cases like the one in Fig. [T|is that P(0) ~ 
1 — (3 whenever the distribution is scaling. This property can be analytically proved by solving 
Eq. © when P(0) is given as 1 - (3. Since P(> k + 1) = 1 - YX>=o p ( k ')> p ( k + 1) can be 
represented as a function of P(0), . . ., P{k), and j3. When k = 0, P(l) can be found as a function 
of P(0) and (3, and when k = 1, P(2) can also be found as a function of P(0) and (3 using P(l) 
obtained already. If we repeat this process, {P(k)\k > 1} will all be found as a function of P(0) 
and (3. When we substitute P(0) = 1 — (3, found numerically in scaling cases, we can obtain P{k) 




(7) 
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FIG. 1: Evolution of the cumulative distribution P(> k) when N = 10 5 , a = 1, and /3 = 0.1 in a log-log 
plot. From an initial distribution, P(k) = Ski, we observe how P(> k) evolves as the number of timesteps 
T varies from to 10 9 (OX 10 10 (□), 10 11 (O), 10 12 (A). We can observe that P(> 1) ~ (3(= 0.1), which 
leads us to -P(O) = 1 — P(> 1) ~ 1 — f3. The dashed line represents the theoretical stationary distribution 

P(>k) = l/(9/c + l)at/? = 0.1. 

and P(>k) in closed forms as below, 



This is the Zipf's law, P(k) oc k~ 2 and P(> k) oc k" 1 , valid when k{l — (3) is big enough. The 
shape of P(k) in Eq. © does not depend on a, but as we will show later a will play a significant 
role in deciding whether the system is scaling or not. 

As shown above, the relation P(0) = 1 — (3 is the condition for the scaling stationary distri- 
butions. In other words, a scaling condition for a and (3 can be found if we find a condition with 
which the condition -P(O) = 1 — (3 holds. To observe when the relation -P(O) = 1 — (3 holds, we 
find P(0) for various a and (3 values using numerical simulations. In Fig. O we show P(0) versus 
a, and P(0) versus (3 when N = 10 4 . When (3 is given, P(0) equals to 1 — (3 when a is greater 
than a certain critical value, at c ((3), and when a is given, -P(O) equals to 1 — (3 when (3 is less than 
a certain critical value, f3 c (a) [(3 = f3 c (a) is the inverse functions of a = a c ((3)]. Therefore we 
find that a critical relation exists for the system to exhibit the scaling behavior, and the boundary 
between the scaling region and non-scaling region is represented by a = a c (f3). 

How do we estimate this critical boundary in (a, /5)-space analytically? One possible argument 
uses the highest k value, k M - Since N is finite in the model, the power law will be valid only for a 



P{k) 




P(>k) 



1 -/?[* + 1/(1 -/?)][* + 0/(1 -/?)]' 
(3 1 



(8) 



1- (3k + (3/(l- (3)' 
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FIG. 2: When N = 10 4 and T = 10 9 ~ 10 10 , P(O) values are found numerically for various a and 
values (averaged over 10 runs), (a) P(0) versus a when (3 = 0.1 (OX 0.3 (□), 0.5 (O), 0.7 (A), 0.9 (*). 
.P(O) is close to 1 — (3 when a is greater than a certain value for a given j3. (b) P(0) when j3 for a = 1 (0)> 
5 (□), 10 (O), 100 (A). P(0) is close to 1 — f3 when (3 is less than a certain value for a given a. Moreover, 
we observe that P(0) = 1/(1 + a) from Eq. Q are satisfied when (3=1. The dashed line represents 
P(0) = 1-/3. 

finite range of k, and the position of cutoff, k M , depends on N, a, and (3. Especially when Eq. ® 
is satisfied (scaling cases), we can estimate ku by solving the equation below, 



a 



E kP(k) 

k=0 







I -13 Jo 



dk 



k 



(9) 



[k + l/(l-p)][k + P/(l- 

which is a modification of Eq. © by letting P(k) = when k > k M (reasonable because P(k) 
obtained from the continuous approximation is not valid when N is finite and k is high). By 
solving Eq. ©, the estimated value of k M for scaling cases is 

a 



k 



M 



■ (3 1 -p exp 



13/(1- P) 



(10) 



Then we can find the ratio of the number of elements that are supposed to be in the region k > ku 
to the total number of elements N, which can be obtained from P(> k) at k = kM, 

(3 1 



P Ok 



Mi 



1 - (3k M 
(3^ exp 



-a 



f(a,(3). 



(11) 



_(3/(l-(3)_ 

If the ratio, f(a, (3), is small enough, these elements that were supposed to be in k > kM can 
be regarded as additional elements with k = 0, changing P(0) into P(0) + f (a, (3), and they will 
not disrupt the stationary power-law distribution. However when f(a, (3) is not small, the whole 
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FIG. 3: The scaling condition in (a, /3)-space. The dashed line represents f(a, (3) = e when e = 10~ 3 . As 
a increases, the range of for the power law approaches < j3 < 1. 

distribution can be disrupted [see how -P(O) affects all elements in Eq. ©], and the distribution 
will settle into another type of stationary distributions, which decay much faster than scaling ones 
do. Therefore, the scaling condition can be written as f(a,{3) < e where < e <C 1. In Fig. 
13 we plot this condition when e = 1CT 3 , estimated from the simulation results. The critical 
boundary, a = a c ((3), that separates the scaling region from the non-scaling region was obtained 
from /(a, (3) = e, 

MP) = Y^^[ e ~ 1 ^]- (12) 

The scaling region shown in (a, /5)-space corresponds well with results in Fig.|2j and is surpris- 
ingly big. If a is big enough, the system exhibits the power law for almost any value of (5, which 
means that just a slight advantage given to the rich is enough to make the system follow the Zipf 's 
law. In Fig. HJ we observe several cases with various parameter values using numerical simula- 
tions. In Fig.Uta), we fix a at 5 and vary (3, to observe that /3 C (5) ~ 0.5. In Fig.Htb), we vary 
a and (5 to observe that (3 c (a) increases as a increases. When (a, /3) is in the power-law region 
[P < P c (a)], the shape of the stationary distribution is determined by (3 only, and a only changes 
the position of the cutoff, k M . On the other hand, the shape of the stationary distribution will be 
determined by both a and (3 for non-scaling cases [(3 > (3 c (a)]. 
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FIG. 4: Stationary distributions for various {a, (3) values. Dashed lines represent theoretical stationary 
probability distributions for given (3 values, and data points are logarithmically binned for scaling cases, (a) 
When N = 10 6 and T = 7 x 10 11 , a is fixed at 5, and (3 = 0.2 (O), 0.4 (□), 0.6 (O), 0.8 (A), 1.0 (*). The 
power-law distributions are observed clearly when (3 = 0.2 and 0.4. (b) Examples of various (a, (3) values 
exhibiting power-law distributions after T = 10 12 : N = 10 6 , a = 1, (3 = 0.2 (O); N = 10 5 , a = 10, 
f3 = 0.4 (□); N = 10 4 , a = 100, (3 = 0.6 (O); N = 10 3 , a = 1000, (3 = 0.8 (A). 

IV. POSSIBLE APPLICATIONS 

So far we have proposed a simple model using general terms such as elements and quantities. 
Here we discuss three examples where this mechanism can be applied. 



A. Personal wealth 



The first example is the wealth distribution with people and their assets, which is known to 
exhibit the power law especially for the richest people. In a society, the population does not grow 
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always, and their total amount of assets can be assumed to be conserved. People also interact in 
many ways, changing their assets, and the rich have an advantage over the poor. In our model, a 
becomes the average amount of asset per person, and (3 is the parameter representing the advantage 
for the rich. Because a is usually big enough, the power-law with Zipf exponent, 7 = 2, will 
emerge for almost any value of (3, while empirical data shows 7 ~ 2.091 |1]. There are other 
non-growth models for the power-law wealth distributions, which use the binary interactions of 
the traders 



B. City sizes 



The second example is the distribution of city sizes with cities and their sizes. This is the 
original Zipf's law, and the Zipf exponent has become famous for this phenomenon (originally 
Zipf used the rank statistics and the exponent is 1, which is equivalent to 7 = 2 in our case). Our 
model can be applied to this case when the number of cities is fixed, and the overall population does 
not grow. Here an interaction is the migration of a person (or a family) from one city to another. 
People tend to move from a small city to a larger city; hence, (3 is the parameter representing this 
tendency. Then, the Zipf's law will emerge from our model. It will be unrealistic if P(0) is not 
close to because there is no empty cities usually. But when a is big enough and (3 is close to 
1, the distribution will be still scaling and -P(O) will be close to 0. Even with a drawback of not 
taking account of the growth of cities from within unlike other models [3], this mechanism has 
some merits to be regarded as another valid explanation of the Zipf's law: (1) the model produces 
the Zipf exponent naturally with a simple mechanism, (2) the migration of people between cities is 
well-represented by the model, (3) the attractiveness of the bigger cities is also well-represented. 
There also can be a different approach. For example, Zanette and Manrubia used the stochastic 
linear model, which assumes neither the growth nor the binary interactions. 



C. Scale-free networks 



The last example is the network with nodes and their degrees. A network is an entity that 
consists of nodes and links, while the degree of a node is the number of links connected to a 
given node. In many systems represented by networks, degrees of nodes have been found to 
follow power-law distributions (hence called scale-free networks). Based on the mechanism of 
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linear preferential attachment proposed by Ref. [20], many extended models have been followed 



25, 



26 



2711 . In these models, the assumption of growth of nodes and links is crucial, and 



interactions between nodes are either ignored or used as an extra feature 12611 . This approach is 
valid for many scale-free networks, but not suitable for non-growing networks, in which node 
interactions are vital to their dynamics. Our model can generate this kind of scale-free networks 
by interpreting the interaction between nodes as the rewiring process. When nodes, i and j, are 
chosen, the rewiring process changes the link from (i', i) to where i' is a pivot node chosen 
from nodes that are linked to i (loops and multiple links are allowed). Therefore, from our model, 
networks with power-law degree distributions can be generated through only rewiring, and the 



ated to 



results will be presented in a forthcoming article [ 28] . The network concept is actually re 
many scaling phenomena, since they can be represented by networks directly II13L Il4l ]l 
or indirectly [29]. 




mm 



V. CONCLUSIONS 



We have proposed a preferential-redistribution mechanism that generates power-law distribu- 
tions with the Zipf exponent for certain parameter ranges, and this scaling region in our parameter 
space has been found analytically using some numerical results. Since this scaling region is big 
enough and the mechanism is very simple, our model can be a good candidate to be used as a base 
mechanism for models describing some scaling phenomena, and three possible applications have 
been discussed here. Like other models, our model has limited applicability, but we believe that 
it can be extended to suit specific needs as a part of more realistic models, or generalized to have 
more flexible features. 
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